Search CORE

507 research outputs found

The devices, experimental scaffolds, and biomaterials ontology (DEB): a tool for mapping, annotation, and analysis of biomaterials' data

Author: Blei D. M.
Chang P.
Deepu S.
FRA
Gene Ontology Consortium
Hassanpour S.
Kondylakis H.
Kononova O.
McGuinness D. L.
Noy N.
Rawat S.
Rees R.
Seppälä S.
Smola A.
Tchoua R. B.
Wang X. H.
Publication venue
Publication date: 01/01/2020
Field of study

The size and complexity of the biomaterials literature makes systematic data analysis an excruciating manual task. A practical solution is creating databases and information resources. Implant design and biomaterials research can greatly benefit from an open database for systematic data retrieval. Ontologies are pivotal to knowledge base creation, serving to represent and organize domain knowledge. To name but two examples, GO, the gene ontology, and CheBI, Chemical Entities of Biological Interest ontology and their associated databases are central resources to their respective research communities. The creation of the devices, experimental scaffolds, and biomaterials ontology (DEB), an open resource for organizing information about biomaterials, their design, manufacture, and biological testing, is described. It is developed using text analysis for identifying ontology terms from a biomaterials gold standard corpus, systematically curated to represent the domain's lexicon. Topics covered are validated by members of the biomaterials research community. The ontology may be used for searching terms, performing annotations for machine learning applications, standardized meta-data indexing, and other cross-disciplinary data exploitation. The input of the biomaterials community to this effort to create data-driven open-access research tools is encouraged and welcomed.Preprin

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Statistical Mechanical Development of a Sparse Bayesian Classifier

Author: Alon U.
Fisher R. A.
Gallager R. G.
Kabashima Y.
Kabashima Y.
MacKay D. J. C.
Malzahn D.
Mezard M.
Mezard M.
Neal R. M.
Nishimori H.
Nishimori H.
Opper M.
Pearl J.
Smola S. J.
Tipping M. E.
Vapnik V. N.
Watkin T. H.
Publication venue: 'Japan Society of Applied Physics'
Publication date: 21/10/2005
Field of study

The demand for extracting rules from high dimensional real world data is increasing in various fields. However, the possible redundancy of such data sometimes makes it difficult to obtain a good generalization ability for novel samples. To resolve this problem, we provide a scheme that reduces the effective dimensions of data by pruning redundant components for bicategorical classification based on the Bayesian framework. First, the potential of the proposed method is confirmed in ideal situations using the replica method. Unfortunately, performing the scheme exactly is computationally difficult. So, we next develop a tractable approximation algorithm, which turns out to offer nearly optimal performance in ideal cases when the system size is large. Finally, the efficacy of the developed classifier is experimentally examined for a real world problem of colon cancer classification, which shows that the developed method can be practically useful.Comment: 13 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Reproducing Kernels of Generalized Sobolev Spaces via a Green Function Approach with Distributional Operators

Author: A. Berlinet
A. Bouhamidi
A. Bouhamidi
A.J. Smola
B. Schölkopf
D.G. Schweikert
E.M. Stein
G. Wahba
G.E. Fasshauer
Gregory E. Fasshauer
H. Wendland
J. Duchon
J. Kybic
M.D. Buhmann
M.L. Stein
Qi Ye
R. Schaback
R.A. Adams
W.A. Light
W.R. Madych
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/03/2013
Field of study

In this paper we introduce a generalized Sobolev space by defining a semi-inner product formulated in terms of a vector distributional operator

\mathbf{P}

consisting of finitely or countably many distributional operators

P_n

, which are defined on the dual space of the Schwartz space. The types of operators we consider include not only differential operators, but also more general distributional operators such as pseudo-differential operators. We deduce that a certain appropriate full-space Green function

G

with respect to

L:=\mathbf{P}^{\ast T}\mathbf{P}

now becomes a conditionally positive definite function. In order to support this claim we ensure that the distributional adjoint operator

\mathbf{P}^{\ast}

\mathbf{P}

is well-defined in the distributional sense. Under sufficient conditions, the native space (reproducing-kernel Hilbert space) associated with the Green function

G

can be isometrically embedded into or even be isometrically equivalent to a generalized Sobolev space. As an application, we take linear combinations of translates of the Green function with possibly added polynomial terms and construct a multivariate minimum-norm interpolant

s_{f,X}

to data values sampled from an unknown generalized Sobolev function

f

at data sites located in some set

X \subset \mathbb{R}^d

. We provide several examples, such as Mat\'ern kernels or Gaussian kernels, that illustrate how many reproducing-kernel Hilbert spaces of well-known reproducing kernels are isometrically equivalent to a generalized Sobolev space. These examples further illustrate how we can rescale the Sobolev spaces by the vector distributional operator

\mathbf{P}

. Introducing the notion of scale as part of the definition of a generalized Sobolev space may help us to choose the "best" kernel function for kernel-based approximation methods.Comment: Update version of the publish at Num. Math. closed to Qi Ye's Ph.D. thesis (\url{http://mypages.iit.edu/~qye3/PhdThesis-2012-AMS-QiYe-IIT.pdf}

arXiv.org e-Print Archive

Crossref

Angular sensitivity of blowfly photoreceptors: intracellular measurements and wave-optical predictions

Author: AW Snyder
C Pask
C Pask
C Pask
CB Boschek
D Marcuse
D. G. Stavenga
DG Stavenga
DG Stavenga
DGM Beersma
GA Horridge
H Muijser
J Scholes
J Schwemer
J Schwemer
J. G. J. Smakman
J. H. van Hateren
JGJ Smakman
JH Hateren van
JW Kuiper
K Kirschfeld
K Kirschfeld
K Vogt
KF Barrell
N Franceschini
P Streck
R Hengstenberg
RC Hardie
S Razmjoo
U Smola
Y Washizu
Publication venue
Publication date: 01/01/1984
Field of study

The angular sensitivity of blowfly photoreceptors was measured in detail at wavelengths λ = 355, 494 and 588 nm. The measured curves often showed numerous sidebands, indicating the importance of diffraction by the facet lens. The shape of the angular sensitivity profile is dependent on wavelength. The main peak of the angular sensitivities at the shorter wavelengths was flattened. This phenomenon as well as the overall shape of the main peak can be quantitatively described by a wave-optical theory using realistic values for the optical parameters of the lens-photoreceptor system. At a constant response level of 6 mV (almost dark adapted), the visual acuity of the peripheral cells R1-6 is at longer wavelengths mainly diffraction limited, while at shorter wavelengths the visual acuity is limited by the waveguide properties of the rhabdomere. Closure of the pupil narrows the angular sensitivity profile at the shorter wavelengths. This effect can be fully described by assuming that the intracellular pupil progressively absorbs light from the higher order modes. In light-adapted cells R1-6 the visual acuity is mainly diffraction limited at all wavelengths.

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

University of Groningen Digital Archive

Dissertations of the University of Groningen

Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy

Author: De S
Gretton A
Ramdas A
Smola A
Strathmann H
Sutherland DJ
Tung H-Y
Publication venue: 5th International Conference on Learning Representations
Publication date
Field of study

We propose a method to optimize the representation and distinguishability of samples from two probability distributions, by maximizing the estimated power of a statistical test based on the maximum mean discrepancy (MMD). This optimized MMD is applied to the setting of unsupervised learning by generative adversarial networks (GAN), in which a model attempts to generate realistic samples, and a discriminator attempts to tell these apart from data samples. In this context, the MMD may be used in two roles: first, as a discriminator, either directly on the samples, or on features of the samples. Second, the MMD can be used to evaluate the performance of a generative model, by testing the model's samples against a reference data set. In the latter role, the optimized MMD is particularly helpful, as it gives an interpretable indication of how the model and data distributions differ, even in cases where individual model samples are not easily distinguished either by eye or by classifier

UCL Discovery

A Novel Visual Word Co-occurrence Model for Person Re-identification

Author: AJ Smola
B Hariharan
C Liu
D Gray
J Gemert van
L Bazzani
M Dikmen
N Gheissari
ND Bird
O Javed
PF Felzenszwalb
RE Fan
T Jebara
V-H Nguyen
W Li
WS Zheng
Publication venue
Publication date: 23/10/2014
Field of study

Person re-identification aims to maintain the identity of an individual in diverse locations through different non-overlapping camera views. The problem is fundamentally challenging due to appearance variations resulting from differing poses, illumination and configurations of camera views. To deal with these difficulties, we propose a novel visual word co-occurrence model. We first map each pixel of an image to a visual word using a codebook, which is learned in an unsupervised manner. The appearance transformation between camera views is encoded by a co-occurrence matrix of visual word joint distributions in probe and gallery images. Our appearance model naturally accounts for spatial similarities and variations caused by pose, illumination & configuration change across camera views. Linear SVMs are then trained as classifiers using these co-occurrence descriptors. On the VIPeR and CUHK Campus benchmark datasets, our method achieves 83.86% and 85.49% at rank-15 on the Cumulative Match Characteristic (CMC) curves, and beats the state-of-the-art results by 10.44% and 22.27%.Comment: Accepted at ECCV Workshop on Visual Surveillance and Re-Identification, 201

arXiv.org e-Print Archive

Crossref

Weakly supervised approaches for quality estimation

Author: A Smola
C Schütze
Carl Vogel
Erwan Moreau
H Wickham
M Hall
S Shevade
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Using data mining for wine quality assessment

Author: A. Smola
B. Boser
D. Rumelhart
D. Smith
E. Turban
H. Yu
I. Guyon
I. Moreno
I.H. Witten
L. Sun
M. Yu
P. Cortez
R. Kewley
S. Ebeler
S. Kramer
T. Dietterich
T. Hastie
V. Cherkassy
W. Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Certiﬁcation and quality assessment are crucial issues within the wine industry. Currently, wine quality is mostly assessed by physico- chemical (e.g alcohol levels) and sensory (e.g. human expert evaluation) tests. In this paper, we propose a data mining approach to predict wine preferences that is based on easily available analytical tests at the certiﬁ- cation step. A large dataset is considered with white vinho verde samples from the Minho region of Portugal. Wine quality is modeled under a re- gression approach, which preserves the order of the grades. Explanatory knowledge is given in terms of a sensitivity analysis, which measures the response changes when a given input variable is varied through its do- main. Three regression techniques were applied, under a computationally efficient procedure that performs simultaneous variable and model selec- tion and that is guided by the sensitivity analysis. The support vector machine achieved promising results, outperforming the multiple regres- sion and neural network methods. Such model is useful for understand- ing how physicochemical tests affect the sensory preferences. Moreover, it can support the wine expert evaluations and ultimately improve the production

Universidade do Minho: RepositoriUM

Crossref

Modelling Issues in Kernel Ridge Regression

Author: A B Kock
A J Smola
B Sch�lkopf
D S Broomhead
G C Cawley
G S Kimeldorf
H Raiffa
H White
J H Stock
J H Stock
J Mercer
K Yao
M C Medeiros
N Aronszajn
P Exterkate
Peter Exterkate
S Bochner
S C Ludvigson
S C Ludvigson
T Hofmann
T Poggio
T Ter�svirta
T Ter�svirta
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Crossref

Elastic Maps and Nets for Approximating Principal Manifolds and Their Application to Microarray Data Visualization

Author: A Gorban
A Gorban
A Gusev
A Zinovyev
A. N. Gorban
AJ Smola
AJ Smola
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AN Gorban
AY Zinovyev
AY Zinovyev
B Kégl
B Kégl
B Mirkin
B Schölkopf
CM Bishop
CM Perou
D Stanford
DG Kendall
E Erwin
F Mulier
H Ritter
H Yin
H Yin
H Zou
JB Tenenbaum
JD Banfield
K Pearson
Kégl
L Aizenberg
L Dyrskjot
M Born
M Frećhet
M LeBlanc
M Oja
R Durbin
R Sayle
R Shyamsundar
S Kaski
S Roweis
T Hastie
T Hastie
T Kohonen
VA Dergachev
W Cai
Y Wang
YF Leung
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/12/2007
Field of study

Principal manifolds are defined as lines or surfaces passing through ``the middle'' of data distribution. Linear principal manifolds (Principal Components Analysis) are routinely used for dimension reduction, noise filtering and data visualization. Recently, methods for constructing non-linear principal manifolds were proposed, including our elastic maps approach which is based on a physical analogy with elastic membranes. We have developed a general geometric framework for constructing ``principal objects'' of various dimensions and topologies with the simplest quadratic form of the smoothness penalty which allows very effective parallel implementations. Our approach is implemented in three programming languages (C++, Java and Delphi) with two graphical user interfaces (VidaExpert http://bioinfo.curie.fr/projects/vidaexpert and ViMiDa http://bioinfo-out.curie.fr/projects/vimida applications). In this paper we overview the method of elastic maps and present in detail one of its major applications: the visualization of microarray data in bioinformatics. We show that the method of elastic maps outperforms linear PCA in terms of data approximation, representation of between-point distance structure, preservation of local point neighborhood and representing point classes in low-dimensional spaces.Comment: 35 pages 10 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref